A Semantic Kernel to Exploit Linguistic Knowledge

نویسندگان

  • Roberto Basili
  • Marco Cammisa
  • Alessandro Moschitti
چکیده

Improving accuracy in Information Retrieval tasks via semantic information is a complex problem characterized by three main aspects: the document representation model, the similarity estimation metric and the inductive algorithm. In this paper an original kernel function sensitive to external semantic knowledge is defined as a document similarity model. This semantic kernel was tested over a text categorization task, under critical learning conditions (i.e. poor training data). The results of cross-validation experiments suggest that the proposed kernel function can be used as a general model of document similarity for IR

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese Hedge Scope Detection Based on Structure and Semantic Information

Hedge detection aims to distinguish factual and uncertain information, which is important in information extraction. The task of hedge detection contains two subtasks: identifying hedge cues and detecting their linguistic scopes. Hedge scope detection is dependent on syntactic and semantic information. Previous researches usually use lexical and syntactic information and ignore deep semantic in...

متن کامل

M ODELS by Tong Wang A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy

Exploiting Linguistic Knowledge in Lexical and Compositional Semantic Models Tong Wang Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2016 A fundamental principle in distributional semantic models is to use similarity in linguistic environment as a proxy for similarity in meaning. Known as the distributional hypothesis, the principle has been successfully app...

متن کامل

Induction of Classifiers through Non-Parametric Methods for Approximate Classification and Retrieval with Ontologies

This work concerns non-parametric approaches for statistical learning applied to the standard knowledge representations languages adopted in the Semantic Web context. We present methods based on epistemic inference that are able to elicit and exploit the semantic similarity of individuals in OWL knowledge bases. Specifically, a totally semantic and language independent semi-distance function is...

متن کامل

The Manifestation Challenge: The Debate between McDowell and Wright

In this paper, we will discuss what is called “Manifestation Challenge” to semantic realism, which was originally developed by Michael Dummett and has been further refined by Crispin Wright. According to this challenge, semantic realism has to meet the requirement that knowledge of meaning must be publically manifested in linguistic behaviour. In this regard, we will introduce and evaluate John...

متن کامل

Linguagrid: a network of Linguistic and Semantic Services for the Italian Language

In order to handle the increasing amount of textual information today available on the web and exploit the knowledge latent in this mass of unstructured data, a wide variety of linguistic knowledge and resources (Language Identification, Morphological Analysis, Entity Extraction, etc.). is crucial. In the last decade LRaas (Language Resource as a Service) emerged as a novel paradigm for publish...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005